Exploring Approaches to Discriminating among Near-Synonyms

نویسندگان

  • Mary Gardiner
  • Mark Dras
چکیده

Near-synonyms are words that mean approximately the same thing, and which tend to be assigned to the same leaf in ontologies such as WordNet. However, they can differ from each other subtly in both meaning and usage—consider the pair of nearsynonyms frugal and stingy—and therefore choosing the appropriate near-synonym for a given context is not a trivial problem. Initial work by Edmonds (1997) suggested that corpus statistics methods would not be particularly effective, and led to subsequent work adopting methods based on specific lexical resources. In earlier work (Gardiner and Dras, 2007) we discussed the hypothesis that some kind of corpus statistics approach may still be effective in some situations, particularly if the near-synonyms differ in sentiment from each other, and we presented some preliminary confirmation of the truth of this hypothesis. This suggests that problems involving this type of nearsynonym may be particularly amenable to corpus statistics methods. In this paper we investigate whether this result extends to a different corpus statistics method and in addition we analyse the results with respect to a possible confounding factor discussed in the previous work: the skewness of the sets of near synonyms. Our results show that the relationship between success in prediction and the nature of the near-synonyms is method dependent and that skewness is a more significant factor.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpus Statistics Approaches to Discriminating Among Near-Synonyms

Near-synonyms are words that mean approximately the same thing, and which tend to be assigned to the same leaf in ontologies such as WordNet. However, they can differ from each other subtly in both meaning and usage—consider the pair of near-synonyms frugal and stingy— and therefore choosing the appropriate near-synonym for a given context is not a trivial problem. Early work on near-synonyms w...

متن کامل

Developing a Computer-facilitated Tool for Acquiring Near-synonyms in Chinese and English (short paper)

This paper is a multi-disciplinary study on cognitive linguistics, computational linguistics and language acquisition. It focuses on application issues of meaning, semantic structures and pragmatics to near-synonyms in Chinese and English languages. The near-synonyms of physical action verbs (PA Verbs) can be distinctive from each other in the way in which their actions are depicted, but in ter...

متن کامل

Comparative Semantics of Sin in Quran and Sociology

The specialists’ concerns in the field of culture regarding the imported sciences and negligence or perhaps moral nihilism exposure of a considerable portion of the Islamic society to sins shows the importance of exploring the meaning and concept of sin in the religious literature (Quran) in comparison with the equivalent idiomatic concept of sin in sociology and obtaining the similarities and ...

متن کامل

Near-synonymy and the structure of lexical knowledge

Plesionyms, or near-synonyms, are words that are almost synonyms, but not quite. The need to deal adequately with plesionymy in tasks such as lexical choice is the basis for two alternatives to conventional models of lexical knowledge: a Saussurean approach and a prototype-theory approach. In this paper, I will discuss these approaches, showing that the latter is troublesome but the former is l...

متن کامل

Single Base Extension and Fourier-Transform Infra-Red Spectroscopy Techniques; Further Approaches in Discriminating Hazelnut-Adulterated Olive Oil

  Background:Confirmation of olive oil authenticity and particularly virgin olive oil has a great importance. Several advanced chemical and genetic analyses have been used to monitor especial components; however each has its limitations especially when detecting hazelnut-adulterated olive oil. Objectives:The objective of this research is to assess the presence of trace amoun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007